GOM-Hadoop: A distributed framework for efficient analytics on ordered datasets
نویسندگان
چکیده
منابع مشابه
GOM-Hadoop: A distributed framework for efficient analytics on ordered datasets
One of the most common datasets exploited by many corporations to conduct business intelligence analysis is event log files. Oftentimes, the records in event log files are temporally ordered, and need to be grouped by certain key with the temporal ordering preserved to facilitate further analysis. One such example is to group temporally ordered events by user ID in order to analyze user behavio...
متن کاملHADOOP: A Framework for Distributed Computing
With data growing so rapidly and the rise of unstructured data accounting for about 90 % of the data today, the time has come for the enterprises to re-evaluate their approach to data storage, management and its analysis. This enormously growing data has been given the name Big Data. Hadoop platform has been designed to tackle the problems associated with handling such an enormous data-that doe...
متن کاملHadoop Mapreduce Framework in Big Data Analytics
As Hadoop is a Substantial scale, open source programming system committed to adaptable, disseminated, information concentrated processing. Hadoop [1] Mapreduce is a programming structure for effectively composing requisitions which prepare boundless measures of information (multi-terabyte information sets) inparallel on extensive bunches (many hubs) of merchandise fittings in a dependable, sho...
متن کاملEfficient hybrid search algorithm on ordered datasets
The increase in the rate of data is much higher than the increase in the speed of computers, which results in a heavy emphasis on search algorithms in research literature. Searching an item in ordered list is an efficient operation in data processing. Binary and interpolation search algorithms commonly are used for searching ordered dataset in many applications. In this paper, we present a hybr...
متن کاملA Hadoop-based Distributed Framework for Efficient Managing and Processing Big Remote Sensing Images
Various sensors from airborne and satellite platforms are producing large volumes of remote sensing images for mapping, environmental monitoring, disaster management, military intelligence, and others. However, it is challenging to efficiently storage, query and process such big data due to the dataand computingintensive issues. In this paper, a Hadoop-based framework is proposed to manage and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Parallel and Distributed Computing
سال: 2015
ISSN: 0743-7315
DOI: 10.1016/j.jpdc.2015.05.003